4356

337 samples and 7577 SNPs from across the northern gulf and western Atlantic. 4356 SNPs after LD thinning


population structure

First, just run a basic PCA:

We get clustering by shallow and deep/ nearshore and offshore, rather than Atlantic vs. Gulf. Suggesting that nearshore in the Atlantic and Gulf are more similar that geographic promixmate animals nearshore and offshore.


The patterns above maybe look like an inversion or something, but the PCA loadings don’t indicate this:


admixture

Sorting the plot below by Atlantic on the left, Gulf on the right. Then shallow to deep by collection location

Note that the left group in the Atlantic are the nearshore individuals, the bottom left in the PCAs above.

ordered shallow to deep, following subset for unequal sample sizes:

PCA admixture populations

NJ tree of pairwise genetic distances

Population assignment conclusion

Both Admixture and DAPC (not shown) indicate 4 populations as the most likely. They also agree with their individual assignments for all but ~5 individuals, who are all fairly admixed.

Isolation by distance:

Next, testing if there is isolation by distance at various scales.

Across all individuals, there is an IBD signal. But I think this is likely driven by underlying comparisons.

Split this out into each individual population across both regions. Within a population we generally see IBD, except for Atlantic Offshore. Inter-population comparisons have no signal.

Can also split into just the Atlantic of Gulf:

And split into Coastal, Intermediate, and Offshore:

This shows that the IBD pattern when all individuals are compared is driven primarily by within population comparisons and IDB is not really, or perhaps only weakly, present between the gulf and the atlantic.


Introgression and hybrids

The Intermediate individuals are possibly hybrids, given the results above they’re intermediate in genetic distance for nearly all analyses. Here, I’ll test this using: 1. f3 statistics 2. treemix 3. triangle plots 4. new hybrids

f3 statistics:

The f3-statistic explicitly tests whether a taxon of interest C results from admixture between two others (A and B): A significantly negative f3-statistic supports the admixture hypothesis, while a positive value is not informative. In our case, our taxon of interest C is Intermediate while A and B are Coastal and Offshore.

First, I calculated these statistics with the four populations assignments:

F3-Statistics with Four Populations
pop1 pop2 pop3 est se z p
Intermediate Coastal_Atl Offshore 0.0037947 0.0006491 5.8461604 0.0000000
Intermediate Coastal_Gulf Offshore -0.0005488 0.0005773 -0.9506458 0.3417842

Remember, positive values are not informative, negative values indicate a population resulting from admixture. No significance here.

Next, I split into 6 populations:

F3-Statistics with Six Populations
pop1 pop2 pop3 est se z p
Intermediate Atlantic
Intermediate_Atlantic Coastal_Atl Offshore_Atlantic 0.0057962 0.0007623 7.6034085 0.0000000
Intermediate_Atlantic Coastal_Atl Offshore_Gulf 0.0057595 0.0006787 8.4857907 0.0000000
Intermediate_Atlantic Coastal_Gulf Offshore_Atlantic 0.0006596 0.0006667 0.9893106 0.3225112
Intermediate_Atlantic Coastal_Gulf Offshore_Gulf 0.0020004 0.0005960 3.3563314 0.0007898
Intermediate Gulf
Intermediate_Gulf Coastal_Atl Offshore_Atlantic 0.0004012 0.0007493 0.5354332 0.5923504
Intermediate_Gulf Coastal_Atl Offshore_Gulf 0.0002826 0.0006526 0.4330949 0.6649458
Intermediate_Gulf Coastal_Gulf Offshore_Atlantic -0.0044799 0.0006952 -6.4443067 0.0000000
Intermediate_Gulf Coastal_Gulf Offshore_Gulf -0.0032210 0.0006094 -5.2856212 0.0000001

This indicates that the intermediate Gulf population is a result of admixture between the coastal gulf and both the Offshore Atlantic and Gulf populations. There is no evidence in the Atlantic Intermediate population.

f4 statistics:

NOT DONE YET!!!!!! use admixtools to do this. or treemix.

Treemix

Maximum likelihood tree estimating drift among populations. Migration edges are fit to the tree to improve populations that are a poor fit to the model. Migration gets addes stepwise. You can estimate the number of migration events that improves the model fit best, similar to structure evanno type approaches.

no migration

First fit the trees with 0 migration events:

Four populations, no migration eventsSix populations, no migration events

Adding migration events:

Best number of migrations events:

  • Four populations: 2 migrations events
  • Six populations: 4 migration events

The four population result is consistent and clear. There are two migration edges, between intermediate and offshore and the node of intermediate/Coastal Gulf and offshore. This tree is well supported and consistent across runs (100 runs, nearly all show this exact tree, below).

In contrast, with 6 populations, things are much more uncertain/unstable. For the most likely tree (top left, in figure below) there are migration edges between the intermediate pops and the branch leading to coastal populations. There is also migration from the node of coastal Gulf/intermediate with both offshore populations. The next 5 most likely trees show similar variations on these migration events. Note that the position of the coastal and intermediate populations are unstable across runs. This maybe isn’t shocking given that these populations aren’t well supported in the other analyses.

Results that are consistent:

  • Offshore Gulf and Offshore Atl are always sister
  • There is no migration with the coastal Atlantic population.
  • there is ample migration between Intermediate and offshore populations.